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Abstract — A  novel  decision-estimation  architecture  for  a 
team  of  agents  cooperating  under  communication  imperfec¬ 
tions  is  presented.  The  scenario  of  interest  is  that  of  a  group 
of  uninhabited  aerial  vehicles  performing  cooperative  task 
assignment  under  communication  delays.  In  the  proposed 
architecture,  each  UAV  in  the  group  runs  multiple  filters  in 
parallel  on:  its  own  states,  teammates’  states,  and  its  states 
as  viewed  by  teammates.  The  estimation  of  team  members’ 
states  allows  each  UAV  to  synchronize  the  transmitted  cost  of 
performing  known  tasks,  obtained  from  the  different  group 
members,  to  a  common  time  base.  It  also  enables  estimating 
the  expected  cost  for  teammates  to  prosecute  new  tasks.  Thus, 
the  group  performance,  under  communication  imperfections, 
can  be  improved.  For  the  estimation,  two  different  algorithms 
are  proposed.  The  first  is  communication  efficient  in  which 
asynchronous  information  updates  are  sent  to  the  network  by 
individual  members  based  on  the  value  of  the  information 
to  the  rest  of  the  group.  Taking  into  account  that  the 
plan  and  plant  of  each  UAV  are  known  to  the  rest  of  the 
group,  improves  the  overall  estimation  process.  Moreover,  it 
allows  proposing  another,  computationally  efficient,  estimation 
algorithm  utilizing  synchronous  information  updates. 

I.  Introduction 

Uninhabited  aerial  vehicles  (UAVs)  can  be  used  for 
various  civilian  and  military  tasks.  On  top  of  the  possibility 
of  performing  tasks  without  putting  human  life  in  harm’s 
way,  the  lack  of  a  human  pilot  allows  significant  weight 
savings  and  enables  carrying  out  missions  with  unique  char¬ 
acteristics,  such  as  extremely  long  duration.  Other  important 
aspects  are  allowing  new  operational  paradigms  and  being 
a  force  multiplier  with  minimal  or  no  human  intervention. 

To  realize  these  advantages,  the  UAVs  must  have  a  high 
level  of  autonomy  and  preferably  work  in  groups.  Ex¬ 
changing  information  within  the  group  is  expected  to  syn¬ 
ergetically  improve  the  group’s  capability.  In  recent  years 
numerous  algorithms  for  performing  collaborative  specific 
tasks  such  as  cooperative  search  [1]  and  classification  [2] 
have  been  proposed.  For  solving  cooperative  multiple  task 
assignment  problems,  involving  for  example  classification, 
attack,  and  kill  verification  on  multiple  targets,  emerging  al¬ 
gorithms  of  different  classes  have  been  proposed,  including: 
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mixed  integer  linear  programming  [3],  [4],  iterative  capac¬ 
itated  transhipment  problem  (CTP)  algorithm  [5],  iterative 
auction  [6],  [7]  and  genetic  algorithms  [8]. 

Assuming  perfect  information  flow,  the  above  mentioned 
algorithms  can  be  implemented  in  a  redundant  centralized 
manner  in  which  each  vehicle  computes  the  cost  for  all 
vehicle-task  combinations.  However,  in  a  realistic  scenario 
communication  constraints  are  expected.  Information  flow 
imperfections,  such  as  communication  delays,  may  pro¬ 
duce  different  information  sets  for  the  different  UAVs  in 
the  group  leading  to  multiple  strategies.  The  result  can 
be  uncoordinated  assignments,  such  as  multiple  vehicles 
wrongly  assigned  to  perform  the  same  task  on  a  certain 
target,  leaving  other  tasks  unassigned  [9]. 

Decentralized  implementation  of  the  decision  algorithms, 
such  as  iterative  CTP  and  iterative  auction,  may  reduce  the 
sensitivity  to  communication  imperfections  [9].  In  such  an 
approach,  each  vehicle  computes  only  its  own  cost  to  prose¬ 
cute  available  tasks;  and  communicates  this  information  to 
the  rest  of  the  group.  A  problem  arises  of  synchronizing 
these  costs  to  a  common  time  base.  In  this  paper  we 
propose  solving  this  problem  by  letting  the  UAVs  also 
communicate  information  regarding  their  state.  Thus,  each 
UAV  can  employ  an  estimation  algorithm  on  all  of  the  team 
members’  states  enabling  a  time  update  of  the  individual 
received  costs. 

For  the  estimation  process  data  need  be  fused  from 
several  sources.  Data  fusion  topologies  can  be  classified  into 
three  types:  central,  decentralized,  and  hierarchical.  In  the 
central  topology  one  agent  performs  the  data  fusion  of  the 
information  obtained  from  the  different  sensors.  Thus,  all 
measurements  must  be  communicated  to  the  central  unit,  in¬ 
volving  high  computational  burden  on  that  unit.  In  contrast, 
the  decentralized  topology  is  based  on  local  fusion  in  each 
agent,  of  information  communicated  in  the  network.  The 
decentralized  topologies  rely  on  communication  between 
nearby  agents.  Thus,  the  communication  messages  are  in¬ 
dependent  of  the  entire  group  size.  This  attribute  allows 
scalability  of  the  decentralized  system  to  large  group  sizes. 
The  hierarchical  topology  is  a  hybrid  of  the  two  former 
ones.  In  this  paper  decentralized  collaborative  estimation 
algorithms,  featuring  highly  reduced  communication  and 
computational  load,  are  presented. 

The  remainder  of  this  manuscript  is  organized  as  follows: 
In  the  next  section  the  UAV  cooperative  decision  problem 
under  information  flow  imperfections  is  posed.  It  is  fol¬ 
lowed  by  the  synthesis  of  computation  and  communication 
efficient  decentralized  estimation  algorithms.  A  simulation 
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study  showing  the  performance  of  the  new  estimation  algo¬ 
rithms  is  then  presented.  Concluding  remarks  are  offered  in 
the  last  section. 

II.  UAV  Cooperative  Decision  Problem 

The  cooperative  UAV  task  assignment  problem  in  scenar¬ 
ios  such  as  wide  area  search  and  destroy  (WASD)  [5]  and 
combat  intelligence  surveillance  and  reconnaissance  (ISR) 
serves  as  the  background  for  the  decision  problem  addressed 
in  this  paper.  We  first  describe  the  centralized  cooperative 
decision  problem  and  then  present  a  decentralized  version 
of  it.  The  effects  of  communication  imperfections  and 
the  proposed  modifications  using  decentralized  estimation 
algorithms  are  then  discussed. 

A.  Task  Assignment  Problem 

Let  T  =  { 1.2. ....  Nt }  be  the  set  of  targets  found  and 
let  V  =  {1,2 ,...,NV}  be  a  set  of  UAVs  performing  tasks 
on  these  targets.  The  goal  of  the  task  assignment  algorithm 
is  to  make  efficient  use  of  the  resources  v  C  V,  while 
prosecuting  all  the  tasks  on  all  the  targets.  For  example, 
tasks  associated  with  the  WASD  mission  are  search,  clas¬ 
sify,  attack,  and  verify.  Since  the  default  task  for  unassigned 
vehicles  is  searching,  it  is  not  included  in  the  assignment 
problem.  Note  that  the  multiple  tasks  per  target  (classify, 
attack,  and  verify)  must  be  performed  in  order,  e.g.  a  target 
must  be  classified  before  it  can  be  attacked.  There  are 
many  different  cost  functions  that  can  be  minimized  in  the 
task  assignment  problem  such  as  total  distance  travelled  by 
vehicles,  or  the  time  required  to  prosecute  all  of  the  targets. 
In  this  study  the  total  accumulated  cost  for  all  of  the  vehicles 
to  perform  all  of  the  tasks  on  all  of  the  targets  is  used  as 
the  cost  function.  This  assignment  problem  is  posed  as  a 
combinatorial  optimization  problem  in  [8], 

B.  Decentralized  Task  Assignment  Problem 

As  discussed  earlier  many  different  solutions  to  the  task 
assignment  problem  have  been  proposed  [3]— [8],  Being 
single  assignment  algorithms,  running  multiple  times  for 
the  multiple  tasks,  the  iterative  CTP  and  auction  algorithms 
are  the  natural  candidates  for  decentralized  implementa¬ 
tion;  promising  reduced  sensitivity  to  communication  im¬ 
perfections.  In  each  iteration  every  vehicle  computes  the 
cost  of  performing  the  available  tasks,  based  on  its  own 
information.  This  cost  is  denoted  Cj.fc  where  i  £  V  and 
k  £  T.  When  one  iteration  is  completed,  any  tasks  left  over, 
including  follow-on  tasks,  are  moved  to  the  next  iteration. 

In  the  process  of  calculating  costs,  the  vehicles  rely  on 
internal  and  external  information.  Thus, 

Ci,k  fi 5  (1) 

where  xt  represents  the  vehicle’s  internal  states  and  xe 
represents  the  external  ones.  The  internal  states  are  defined 
as  those  that  only  the  vehicle  has  access  to  ( i.e .  information 
that  is  not  explicitly  communicated  to  others)  such  as  fuel 
levels,  weapons  status,  etc.  To  the  external  states,  such  as 
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Fig.  1  :  Cost  matrix  example. 


the  vehicle’s  position  and  it’s  targets,  the  other  vehicles  have 
access  to.  Note  that  the  function  /,  is  known  to  all  team 
members. 

When,  for  example,  the  iterative  CTP  is  decentrally 
solved,  costs  from  all  of  the  vehicles  are  formed  into  a 
matrix.  Each  row  corresponds  to  the  cost,  transmitted  from 
one  of  the  vehicles,  to  perform  all  the  available  tasks.  In 
Fig.  1  such  a  matrix  is  presented  for  a  scenario  between 
3  UAVs  and  3  targets.  Note  that  each  row  includes  also  a 
time  stamp,  indicating  the  time  of  when  the  computation  of 
Eq.  1  has  been  performed  by  the  representative  vehicle. 

C.  Dealing  with  Communication  Delays 

In  this  study  we  assume  a  stationary  known  environment 
(fixed  known  targets)  and  that  the  communication  imper¬ 
fections  cause  uncertainty  on  teammates  position  and  their 
respective  updated  costs.  We  utilize  the  important  difference 
between  internal  and  external  states  such  that  the  external 
ones  can  be  estimated  by  the  other  vehicles  while  the 
internal  ones  may  not.  We  also  assume  that  the  internal 
states  do  not  change  in  the  time  frame  of  importance. 
Thus,  we  concentrate  here  on  estimating  the  position  of 
teammates.  Having  estimates  about  the  current  location  of 
teammates  will  enable  synchronizing  individual  costs  (rows 
in  the  decision  matrix)  from  the  different  members  to  a 
common  time  basis,  thus  robustifying  the  group  decision 
process  with  respect  to  communication  delays.  Moreover, 
it  will  allow  each  member  to  estimate  the  costs  of  the 
other  vehicles  to  new  targets  based  on  prior  communi¬ 
cated  costs  and  information  on  their  external  states.  This 
makes  it  possible  to  produce  preliminary  assignments  prior 
to  communicating  with  the  other  vehicles.  The  resulting 
preliminary  assignment  can  be  used  to  direct  individual 
vehicles  to  expected  trajectories,  until  all  the  costs  for 
teammates  to  prosecute  the  new  targets  have  been  received 
and  the  assignment  algorithm  has  been  re-run. 

III.  Cooperative  Estimation 

In  this  section,  first  the  general  concept  of  decentralized 
estimation  is  discussed  and  then  two  such  algorithms  are 
derived  for  the  investigated  problem.  One  of  the  algorithms 
is  computationally  efficient  while  the  other  is  communica¬ 
tion  efficient.  Both  algorithms  utilize  the  information  form 
of  the  Kalman  filter. 

A.  Decentralized  Estimation  -  Brief  Review 

Decentralized  estimation  is  at  the  heart  of  many  multi¬ 
sensor  systems.  It  is  composed  of  multiple  agents  having 
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local  information,  that  is  shared  through  a  communication 
network  with  neighboring  agents;  and  then  fused  by  differ¬ 
ent  agents  with  their  own  local  information.  Algorithms  for 
the  optimal  fusion  of  received  information  with  local  infor¬ 
mation  in  such  a  distributed  network  have  been  presented 
in  [10].  In  such  algorithms  local  sensor  data  is  processed 
to  generate  a  local  estimate  that  is  then  transmitted  to  other 
agents.  The  basic  philosophy  is  that  each  agent  reconstructs 
the  optimal  estimate  as  if  all  sensor  measurements  were 
transmitted  instead  of  local  estimates.  The  key  step  is  in 
identifying  the  new  information  in  the  communicated  local 
estimates  in  order  to  avoid  double  counting. 

A  well-known  technique  in  data  fusion  algorithms  is  the 
information  filter  (IF)  [11].  It  is  algebraically  equivalent 
to  the  Kalman  filter  (KF)  with  a  computationally  simpler 
update  stage  at  the  cost  of  increased  complexity  in  the  time 
update  stage.  In  this  subsection  the  IF  is  summarized. 

Consider  the  following  linear  system  model: 

xk+i  =  &kxk  +  Bkuk  +  rfcwfc,  xk  G  R"  (2) 

where  {uk},u  G  Rz  is  the  input  vector  and  {ujk},uj  G 
Rp  is  a  zero  mean  white  sequence  with  positive  semi- 
definite  covariance  matrix  Qk  (process  noise).  The  sensor 
observation  is 

zk  =  Hkxk  +  vk,  zk  G  Rm  (3) 

where  {vk},v  G  Rm  is  a  zero  mean  white  sequence  with 
positive  definite  covariance  matrix  Rk  (measurement  noise), 
{wfc},  {ffc},  and  x$  are  assumed  to  be  mutually  independent 
for  all  times. 

The  KF  algorithm  generates  estimates  for  the  state, 
denoted  as  x,  together  with  a  corresponding  estimation 
covariance,  denoted  as  P.  The  IF,  being  a  representation  of 
the  KF  from  the  information  viewpoint,  uses  the  following 
definitions: 


*(■)/<■)  =  Pf)/(0 

(4) 

A 

y«/(-)  =  r(-)/(-):E(-)/(-) 

(5) 

Ik  =  HlR-[Hk 

(6) 

ik  =  HlR-lzk 

(7) 

where  Y  is  the  well-known  Fisher  information  matrix,  ij  is 
the  new  estimated  state  vector,  I  is  the  a  priori  expected 
information  held  in  each  measurement,  and  i  is  the  a 
posteriori  actual  information  held  in  a  single  measurement. 
With  these  definitions,  the  IF  can  be  summarized: 

Time  Update: 

Vk+i/k  =  [l  j  \yk/k  T  Yk/k*f*k  Bkuk ] 

(8) 

Yk+1/k  =  Mk  —  flkTkMk  (9) 

where 

Mk  =  ^TYk/k^k1  (10) 


=  Mkvk  [r^Mferfc  +  Qk *] 

and  1  is  the  identity  matrix  with  appropriate  dimension  and 
Qk  >  0.  If  Qk  —  0  then  Eq.  9  is  not  applicable  and 

lfc+1/tc  =  df/c  • 

Observation  Update: 

Vk+l/k+l  =  Vk+l/k  +  1  (12) 

Yk+i/k+i  =  Yk+1/k  +  Ik+1  (13) 

Note  the  additive  nature  of  the  observation  update  equa¬ 
tions,  making  the  IF  a  natural  candidate  for  decentralized 
estimation. 

B.  Estimation  Model 

For  obtaining  a  realtime  solution  of  the  cooperative  task 
assignment  problem  it  is  usually  assumed  that  the  UAVs 
fly  at  a  constant  altitude  and  speed.  For  our  derivation  of 
a  cooperative  estimation  algorithm  we  will  make  similar 
assumptions.  Thus  we  use  the  Dubins  car  model  [12]  for 
representing  the  planar  dynamics  of  the  vehicles 


x  =  v  cos  6 

(14a) 

y  =  v  sin  9 

(14b) 

9  —  Qmax  tl 

(14c) 

v  =  0 

(14d) 

where  x  and  y  are  the  UAV  horizontal  coordinates  in  a 
Cartesian  inertial  reference  frame;  9  is  the  azimuth  flight 
angle;  v  is  the  speed;  and  f lmax  is  the  maximum  turning  rate 
of  the  vehicle  (travelling  at  the  speed  of  r>).  It  was  shown 
[12]  that  the  optimal  trajectories  of  vehicles  having  the 
dynamics  of  Eqs.  14  consist  of  straight  lines  and  arcs  with 
radius  Rmin  =  v/V,max.  More  specifically,  these  optimal 
trajectories  are  from  two  different  families:  1)  turn-straight- 
turn,  2)  turn-turn-turn.  Let  us  assume  that  all  vehicles  abide 
to  these  strategies.  Thus 

uG  {-1,0,1}  (15) 

We  assume  that  each  UAV  has  measurements  only  on  its 
own  position.  Thus  the  measurement  equation  is 

Zk=  ^  +Vk  (16) 

and  G  R2  is  a  zero  mean  white  sequence  with 

positive  definite  covariance  matrix  Rk.  we  denote  the  mea¬ 
surement  rate  as  /j. 

The  equations  of  motion  for  the  UAVs,  given  in  Eqs.  14, 
are  clearly  nonlinear.  For  the  estimation  process  we  will  be 
using  the  following  state  vector 

x  =  [x  y  vx  vy]T  (17) 

with  the  dynamics 

x  =  Ax  +  B(x)u  +  Gu> 
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where 
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0 

0 

^ max'Vx 

0 

1 

(19) 

We  assume  that  each  UAV  knows  the  control  actions 
of  his  teammates,  since  they  are  all  acting  based  on  the 
same  cooperative  team  task  assignment  plan.  Note  that  in 
the  above  representation  of  the  equations  of  motion  the 
dynamics  matrix  A  is  constant  while  B  is  state  dependent. 
Thus,  A,  and  also  G,  are  identical  to  all  vehicles,  enabling 
the  computationally  efficient  algorithm  presented  next. 


C.  Computationally  Efficient  IF 

In  this  subsection  we  assume  that  each  UAV  receives 
synchronous  updates  from  teammates  at  a  constant  rate 
/2  <  fi-  The  transmitted/recieved  data  can  be  the  raw 
measurements;  better  yet  it  can  be  the  new  information 
gained  since  the  last  transmission.  Both  will  be  discussed 
next. 

We  will  denote  the  estimation  of  the  state  vector  of  UAV 
j  £  V  by  UAV  i  £  V  as  x1'3  and  correspondly  the  IF 
state  vector  is  denoted  y1'3 .  Note  that  xt,t,  represents  the 
estimated  state  vector  of  UAV  i  £  V  by  itself,  based  on 
the  information  transmitted  to  teammates.  The  estimation 
of  ones  own  state,  based  on  all  available  measurements, 
is  denoted  xl.  The  distinction  between  x 1  and  x1’1  will 
become  clear  in  the  sequel. 

The  equations  of  the  Nv  IFs  run  by  each  UAV  i  £  V, 
based  on  the  communicated  information,  at  a  rate  of  /2, 
are: 

Time  Update: 

vii  i/fc  =  [i  - 

BkrtfiT'y&ri]  V  3£V  (20) 

Ykii/k  =  Mfc  -  nkTlMk  (21) 

where  and  Fife  are  computed  based  on  Eq.  10  and  Eq. 
11,  respectively;  4?^,  Bk  and  T/..  are  the  discrete  versions 
of  A,  B  and  G,  respectively;  and  uj,  is  the  control  action  of 
UAV  j  £  V  known  to  UAV  i  £  V  (the  one  performing  the 
estimation  process),  since  it  is  assumed  that  all  the  UAVs 
abide  to  the  same  task  assignment  plan. 

Observation  Update: 


Vkii/k+i  =  V'k+i/k  +  *fc+i  V  3GV 


1k+l/k+l  —  1  k+l/k  ' 


iJ 

1k+ 1 


(22) 

(23) 


Note  that  since  the  quality  of  information  obtained  from 
each  UAV  is  assumed  identical  then  Eqs.  21  and  23  can  be 
computed  only  once  for  all  filters;  thus,  reducing  consider¬ 
ably  the  computational  effort. 

Each  UAV  also  runs  another  IF  on  its  own  states  using 
its  measurements  at  a  rate  of  fi  >  /2.  The  equations  for 


the  state  vector  y1  and  information  matrix  Y‘  of  such  filter 
are  identical  to  the  ones  given  in  Eqs.  8-13,  except  that 
Bk  is  state  dependent,  as  in  Eq.  20. 

The  transmitted  information  to  the  UAV  group  is  i3k  and 
IJk.  If  this  information  is  the  current  measurement  then  IJk 
and  ii  can  be  computed  based  on  Eqs.  6-  7,  respectively. 
However,  it  will  be  more  beneficial  to  send  all  the  gathered 
information  since  the  last  transmission.  Such  information 
can  be  computed  as 


(24) 


d  —  _  ,J<3 

lk  ~  i4-/fc  Hk/k 


(25) 


D.  Communication  Efficient  IF 

In  this  subsection  we  assume  that  each  UAV  re¬ 
ceives  asynchronous  updates  from  teammates.  The  trans¬ 
mitted/recieved  data  IJk  and  if.  is  based  on  the  quality  of 
the  current  state  estimate  of  UAV  j  £  V  by  UAV  i  £  V  as 
expected  by  UAV  j  £  V.  Thus,  the  information  is  sent  to 
the  group  members  by  UAV  j  £  V  only  if 

ej Eej  >  e  (26) 


where 


E  = 


1  0 
0  1 
0  0 
0  0 


x  k/k 


*k/k 


0  0 
0  0 
0  0 
0  0 


(27) 

(28) 


and  e  is  a  design  parameter;  and  E  was  chosen  so  that  the 
error  will  be  defined  between  the  position  estimates. 

The  equations  of  the  Nv  IFs,  run  by  each  UAV  i  £  V 
based  on  the  communicated  information,  are: 

Time  Update: 

VkU/k  =  [1  -  ■  fc]  < T\ytjk  +  Ytf&'Bk 

(1fc/fc_1Wfc/fc)“fc]  V  JGV 

Ytlt =  v  j£V 

where 


(29) 

(30) 


I'/  A /(/'I',,  • 


Qk 


-1 


-1 


V  jev 

(3D 

1  V  j£V  (32) 


Observation  Update: 


Vk+i/k+1  =y%k+i/k+ik+i  v  7'eU  (33) 

nii/k+^nti/k+n+i  v  j&v  ^ 

Note  the  significantly  larger  computational  complexity 
of  this  algorithm  compared  to  the  one  presented  in  the 
previous  subsection.  This  complexity  results  from  the  need 
of  calculating  Eqs.  30,  34  Nv  times  compared  to  only  once 
for  the  computation  efficient  algorithm  (see  Eqs.  21,  23). 
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Fig.  2:  Sample  run  estimation  performance. 

IV.  Algorithm  Implementation 

In  this  section  the  performance  of  the  proposed  esti¬ 
mation  algorithm  is  examined.  First,  the  performance  of 
the  computation  and  communication  efficient  estimation 
algorithms  is  studied  using  a  simplified  simulation.  Then, 
the  communication  efficient  algorithm  is  evaluated  using  the 
MultiUAV  simulation  [13]. 

A.  Simplified  Simulation 

First,  the  performance  of  the  developed  decentralized 
estimation  algorithms  is  evaluated  using  a  simplified  sim¬ 
ulation  of  the  vehicles’  dynamics,  based  on  Eqs.  14.  The 
three  examined  algorithms  are  denoted:  (i)  comp,  efficient 

1  -  raw  measurements  are  transmitted,  (ii)  comp,  efficient 

2  -  all  new  information  gained  from  last  communication 
is  transmitted,  (iii)  comm,  efficient  -  communication  is 
sent  based  on  the  information  measure  of  Eq.  26.  Sample 
performance  of  the  three  different  algorithms  is  presented 
in  Figs.  2-3  where  Fig.  3  presents  a  zoom  in  on  a  part 
of  the  trajectory.  In  this  sample  run,  the  UAV,  of  which 
the  trajectory  is  estimated  by  one  of  its  teammates,  per¬ 
forms  a  counter  clockwise  maneuver  followed  by  a  non- 
maneuverable  straight  line  flight.  It  is  apparent  that  the  best 
estimate  is  that  of  the  UAV  of  itself  while  that  of  algorithm 
(i)  performed  by  teammates  based  only  on  periodic  mea¬ 
surements  sent  from  that  vehicle,  at  a  low  rate  of  /2,  is 
the  worst.  The  best  estimate  by  teammates  is  achieved  by 
using  algorithm  (iii)  which  uses  information  sent  based  on 
its  need.  Note  also  from  Fig.  3  the  non-periodic  nature  of 
the  communicated  information  when  using  algorithm  (iii). 

B.  MultiUAV  Simulation 

The  performance  of  the  proposed  communication  effi¬ 
cient  estimation  algorithm  was  also  evaluated  using  the 
MultUAV  simulation  testbed  containing  high  fidelity  vehi¬ 
cle  dynamics,  inter-vehicle  communications,  target  sensors, 
targets,  and  threats.  In  the  simulated  scenario  two  UAVs 


Fig.  3:  Sample  run  estimation  performance  (zoom  in). 


Fig.  4:  Sample  trajectories  of  cooperating  UAVs. 


perform  multiple  tasks  on  two  targets.  The  UAVs  commu¬ 
nicate  information  with  a  known  delay  of  2sec.  In  Fig  4 
the  actual  and  estimated  trajectories  of  the  two  cooperating 
UAVs  are  plotted  where  the  filled  rectangles  represent  the 
position  of  the  two  targets  being  flown  over  by  the  two 
UAVs. The  asynchronous  observation  updates  are  evident. 
Note  that  in  this  example  the  total  distance  travelled  by 
the  UAVs  to  accomplish  all  the  tasks  was  approximately 
80Km.  For  comparison,  the  cost  for  the  case  of  perfect 
information  was  approximately  57Km,  while  that  for  a 
pure  delay  implemented  without  the  proposed  estimation- 
decision  scheme  was  approximately  lOOKm. 

V.  CONCLUSIONS 

A  novel  decision-estimation  architecture  for  a  team  of 
unmanned  aerial  vehicles  cooperating  under  communica¬ 
tion  imperfections  has  been  presented.  For  the  estimation 
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process  each  UAV  in  the  group  runs  multiple  filters  in 
parallel  on:  its  own  states,  teammates’  states,  and  its  states 
as  viewed  by  teammates.  The  estimation  of  team  mem¬ 
bers’  states  allows  synchronization  of  the  costs  obtained 
from  individual  UAVs  to  known  targets;  and  also  enables 
immediately  estimating  teammates’  costs  to  performing 
tasks  on  newly  found  targets.  Thus,  a  group’s  decentralized 
decision  process,  under  communication  imperfections,  can 
be  improved. 

Taking  into  account  that  the  plan  and  plant  of  each  UAV 
team  member  is  known  to  the  rest  of  the  group  improves 
the  overall  estimation  process.  Moreover,  it  allowed  to 
greatly  reduce  the  computational  complexity  of  the  pro¬ 
posed  algorithm  rendering  its  implementation  in  realistic 
scenarios  feasible.  It  was  also  shown  that  the  amount  of 
communication  within  the  team  can  be  tuned,  based  on  the 
value  of  the  information  to  the  receiving  end. 

References 

[1]  Flint  M.,  Polycarpou,  M.  and  Femandez-Gaucherand,  E.,  “Coopera¬ 
tive  Control  for  Multiple  Autonomous  UAVs  Searching  for  Targets,” 
Proceedings  of  the  41st  IEEE  Conference  on  Decision  and  Control, 
IEEE,  2002. 

[2]  Chandler,  P.  R.  and  Pachter,  M.  N.,  “UAV  Cooperative  Classifica¬ 
tion,”  Workshop  on  Cooperative  Control  and  Optimization,  Kluwer 
Academic  Publishers,  2001. 

[3]  Richards,  A.,  Bellingham,  J.,  Tillerson,  M.,  and  How,  J.  P.,  “Coor¬ 
dination  and  Control  of  Multiple  UAVs,”  Proceedings  of  the  AIAA 
Guidance,  Navigation,  and  Control  Conference,  Monterey,  CA,  2002. 


[4]  Schumacher,  C.  J.,  Chandler,  P.  R.,  Pachter,  M.,  and  Pachter,  L., 
“Constrained  Optimization  for  UAV  Task  Assignment,”  Proceedings 
of  the  AIAA  Guidance,  Navigation,  and  Control  Conference,  Provi¬ 
dence,  RI,  2004. 

[5]  Schumacher,  C.  J.,  Chandler,  P.  R.,  and  Rasmussen,  S.  J.,  “Task 
Allocation  for  Wide  Area  Search  Munitions  Via  Iterative  Network 
Flow  Optimization,”  Proceedings  of  the  AIAA  Guidance,  Navigation, 
and  Control  Conference,  2002. 

[6]  Bertsekas,  D.,  “Auction  Algorithms  for  network  flow  problems:  A  Tu¬ 
torial  Introduction,”  Computational  Optimization  and  Applications, 
Vol.  1,  1992. 

[7]  Chandler,  P.  R.,  Pachter,  M.,  Rasmussen,  S.,  and  Schumacher,  C., 
“Multiple  Task  Assignment  for  a  UAV  Team,”  Proceedings  of  the 
AIAA  Guidance,  Navigation,  and  Control  Conference,  2002. 

[8]  Shima,  T.,  Rasmussen,  S.,  and  Sparks,  A.,  “Multiple  Task  Assign¬ 
ments  for  Cooperating  UAVs  using  Genetic  Algorithms,”  Proceed¬ 
ings  of  the  American  Control  Conference,  2004. 

[9]  Mitchell,  J.  W.  and  Sparks,  A.  G.,  “Communication  Issues  in  the 
Cooperative  Control  of  Unmanned  Aerial  Vehicles,”  Proceedings 
of  the  Forty-First  Annual  Allerton  Conference  on  Communication, 
Control,  &  Computing,  2003. 

[10]  Chong,  C.  Y.,  Mori,  S.,  and  Chang,  K.  C.,  Distributed  Multi  Target 
Multi  Sensor  Tracking ,  Multitarget  Multisensor  Tracking:  Advanced 
Applications,  Y.  Bar-Shalom,  Editor,  Artech  House,  1990,  pp.  247- 
405. 

[11]  Maybeck,  S.  R,  Stochastic  Models,  Estimation,  and  Control,  Vol. 
1,  Vol.  141  of  Mathematics  in  Science  and  Engineering,  Academic 
Press,  1979,  pp.  238-241. 

[12]  Dubins,  L.,  “On  curves  of  minimal  length  with  a  constraint  on 
average  curvature,  and  with  prescribed  initial  and  terminal  position,” 
American  Journal  of  Math,  Vol.  79,  1957,  pp.  497-516. 

[13]  Rasmussen,  S.  J.  and  Chandler,  P.  R.,  “MultiUAV:  A  Multiple  UAV 
Simulation  for  Investigation  of  Cooperative  Control,”  Proceedings  of 
the  Winter  Simulation  Conference,  2002. 


4112 


